350 results found.
Written
Corpus,
Language Type:
Multilingual
Languages:
German Spanish Swedish french
Availability:
Freely Available
License:
Free for academic research
Size:
3 billion (per language) <Not Specified>Production Status:
Newly created-in progress
Use:
<Not Specified>
-
Paper title:Building Large Corpora from the Web Using a New Efficient Tool Chain
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Roland Schäfer | <Not Specified> | None |
| Author 2 | Felix Bildhauer | <Not Specified> | None |
| Main Contact | Roland Schäfer | Freie Universität Berlin | DE |
Documentation:
Yet to be documented
Written
Corpus,
Language Type:
Multilingual
Languages:
Czech English German Russian french
Availability:
Freely Available
License:
Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International
Size:
122 MByte Production Status:
Existing-used
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Multimodal Pivots for Image Caption Translation
-
Paper track:Empirical/Data-Driven
-
Paper status:Accept - Outstanding
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Julian Hitschler | Computational Linguistics, University of Heidelberg | DE |
| Author 2 | Shigehiko Schamoni | Heidelberg University | DE |
| Author 3 | Stefan Riezler | Heidelberg University | DE |
| Main Contact | Stefan Riezler | Heidelberg University | None |
Documentation:
http://www.casmacat.eu/corpus/news-commentary.html
Written
Corpus,
Language Type:
Multilingual
Languages:
English German Mandarin Chinese Russian french
Availability:
Freely Available
License:
CreativeCommons
Size:
81,8 MByte Production Status:
Newly created-finished
Use:
Corpus Creation/Annotation
-
Paper title:Concepticon: A Resource for the Linking of Concept Lists
-
Paper track:Evaluation
-
Paper status:Accept Poster+DemoSuggested
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Johann-Mattis List | Centre des Recherches Linguistiques sur l'Asie Orientale | FR | Philipps-University Marburg | DE |
| Author 2 | Michael Cysouw | Philipps-University Marburg | DE | Philipps-University of Marburg | DE |
| Author 3 | Robert Forkel | Max Planck Institute for the Science of Human History | DE | ||
| Main Contact | Johann-Mattis List | Centre des Recherches Linguistiques sur l'Asie Orientale | None | Max Planck Institute for the Science of Human History | None |
Documentation:
http://concepticon.clld.org
Written
Corpus,
Language Type:
Multilingual
Languages:
English Portuguese Spanish french
Availability:
Freely Available
License:
<Not Specified>
Size:
<Not Specified> sentences Production Status:
<Not Specified>
Use:
Machine Translation, SpeechToSpeech Translation
-
Paper title:Parallel Corpora for the Biomedical Domain
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Aurélie Névéol | LIMSI-CNRS | FR |
| Author 2 | Antonio Jimeno Yepes | IBM | AU |
| Author 3 | Mariana Neves | Hasso Plattner Institute | DE |
| Author 4 | Karin Verspoor | The University of Melbourne | AU |
| Main Contact | Aurélie Névéol | LIMSI-CNRS | None |
Documentation:
<Not Specified>
Written
Ontology,
Language Type:
Multilingual
Languages:
English German Spanish french italian
Availability:
Freely Available
License:
CreativeCommons
Size:
1500000 concepts Production Status:
Existing-used
Use:
Knowledge Discovery/Representation
-
Paper title:Cross-lingual Knowledge Projection Using Machine Translation and Target-side Knowledge Base Completion
-
Paper track:NLP engineering experiment paper
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Naoki Otani | Carnegie Mellon University | US |
| Author 2 | Hirokazu Kiyomaru | Kyoto University | N/A |
| Author 3 | Daisuke Kawahara | Kyoto University | JP |
| Author 4 | Sadao Kurohashi | Kyoto University | JP |
| Main Contact | Naoki Otani | Carnegie Mellon University | None |
Documentation:
English documentation is publicly available: https://github.com/commonsense/conceptnet5/wikiLanguage Type:
Multilingual
Languages:
Basque German Spanish french italian
Availability:
From Data Center(s)
License:
META-SHARE and/or CC
Size:
1040 Production Status:
Newly created-in progress
Use:
Speech Recognition/Understanding
-
Paper title:SAVAS: Collecting, Annotating and Sharing Audiovisual Language Resources for Automatic Subtitling
-
Paper track:Speech
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Arantza del Pozo | Vicomtech-IK4 | ES |
| Author 2 | Carlo Aliprandi | Synthema | IT |
| Author 3 | Aitor Álvarez | Vicomtech-IK4 | ES |
| Author 4 | Carlos Mendes | VoiceInteraction | PT |
| Author 5 | Joao P. Neto | VoiceInteraction | PT |
| Author 6 | Sérgio Paulo | VoiceInteraction | PT |
| Author 7 | Nicola Piccinini | Synthema | IT |
| Author 8 | Matteo Raffaelli | Synthema | IT |
| Main Contact | Arantza del Pozo | Vicomtech-IK4 | None |
Documentation:
Not available yetLanguage Type:
Multilingual
Languages:
English German Russian french
Availability:
Freely Available
License:
CC-BY
Size:
3500000 sentences Production Status:
Existing-used
Use:
Evaluation/Validation
-
Paper title:A Multilingual Dataset for Evaluating Parallel Sentence Extraction from Comparable Corpora
-
Paper track:Written
-
Paper status:Accept Poster
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Pierre Zweigenbaum | LIMSI-CNRS | FR |
| Author 2 | Serge Sharoff | University of Leeds | GB |
| Author 3 | Reinhard Rapp | Aix-Marseille Université | FR |
| Main Contact | Pierre Zweigenbaum | LIMSI-CNRS | None |
Documentation:
http://aclweb.org/anthology/W17-2512.pdf
Written
Corpus,
Language Type:
Multilingual
Languages:
American English German Spanish french italian
Availability:
From Owner
License:
<Not Specified>
Size:
<Not Specified> Production Status:
Newly created-finished
Use:
Information Extraction, Information Retrieval
-
Paper title:Extending HeidelTime for Temporal Expressions Referring to Historic Dates
-
Paper track:Written
-
Paper status:Accept Poster+Demo
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Jannik Strötgen | Heidelberg University | DE | Max-Planck-Institut für Informatik | DE |
| Author 2 | Thomas Bögel | Institute of Computer Science, Heidelberg University | DE | ||
| Author 3 | Julian Zell | Heidelberg University | DE | ||
| Author 4 | Ayser Armiti | Heidelberg University | DE | ||
| Author 5 | Tran Van Canh | Heidelberg University | DE | ||
| Author 6 | Michael Gertz | Heidelberg University | DE | ||
| Main Contact | Jannik Strötgen | Max-Planck-Institut für Informatik | None | Bosch Center for Artificial Intelligence | None |
Documentation:
http://code.google.com/p/heideltime/Language Type:
Multilingual
Languages:
Dutch English Spanish french italian
Availability:
Freely Available
License:
OpenSource
Size:
5 * 5000 Production Status:
Newly created-finished
Use:
Sentiment Analysis
-
Paper title:Generating Polarity Lexicons with WordNet propagation in 5 languages
-
Paper track:Written
-
Paper status:Accept Oral
| Author Number | Name | Affiliation | Country | ||
|---|---|---|---|---|---|
| Author 1 | Isa Maks | VU University Amsterdam | NL | ||
| Author 2 | Ruben Izquierdo | VU University | NL | ||
| Author 3 | Francesca Frontini | PRAXILING - Université Paul-Valéry Montpellier 3 | FR | ILC CNR - Pisa Italy | FR |
| Author 4 | Rodrigo Agerri | IXA NLP Group, University of the Basque Country (UPV/EHU) | ES | ||
| Author 5 | Piek Vossen | VU University Amsterdam | NL | ||
| Author 6 | Andoni Azpeitia | vicomtech | ES | ||
| Main Contact | Isa Maks | VU University Amsterdam | None |
Documentation:
yes
Written
Lexicon,
Language Type:
Multilingual
Languages:
English Finnish Japanese Portuguese french
Availability:
Freely Available
License:
GNU LGPL 2.1
Size:
More than 2 million entries Production Status:
Existing-updated
Use:
General Lexical Resource
-
Paper title:Attaching Translations to Proper Lexical Senses in DBnary
-
Paper track:long paper
-
Paper status:Accept
| Author Number | Name | Affiliation | Country |
|---|---|---|---|
| Author 1 | Andon Tchechmedjiev | <Not Specified> | FR |
| Author 2 | Gilles Sérasset | Univ Grenoble Alpes | None |
| Author 3 | Jérôme Goulian | Univ Grenoble Alpes | None |
| Author 4 | Didier Schwab | Univ Grenoble Alpes | None |
| Main Contact | Andon Tchechmedjiev | IMT Mines Alès | None |
Documentation:
http://dbnary.forge.imag.fr/ Documentaion in English




